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Art Unit: 2654 

DETAILED ACTION 



Double Patenting 

The nonstatutory double patenting rejection is based on a judicially created 
doctrine grounded in public policy (a policy reflected in the statute) so as to prevent the 
unjustified or improper timewise extension of the "right to exclude" granted by a patent 
and to prevent possible harassment by multiple assignees. See In re Goodman, 1 1 
F.3d 1046, 29 USPQ2d 2010 (Fed. Cir. 1993); In re Long/, 759 F.2d 887, 225 
USPQ 645 (Fed. Cir. 1985); In re Van Ornum, 686 F.2d 937, 214 USPQ 761 (CCPA 
1982); In re Vogel, 422 F.2d 438, 164 USPQ 619 (CCPA 1970); and In re Thorington, 
418 F.2d 528, 163 USPQ 644 (CCPA 1969). 

A timely filed terminal disclaimer in compliance with 37 CFR 1 .321(c) may be 
used to overcome an actual or provisional rejection based on a nonstatutory double 
patenting ground provided the conflicting application or patent is shown to be commonly 
owned with this application. See 37 CFR 1 .130(b). 

Effective January 1 , 1 994, a registered attorney or agent of record may sign a 
terminal disclaimer. A terminal disclaimer signed by the assignee must fully comply with 
37 CFR 3.73(b). 

Claims 1 to 10 are rejected under the judicially created doctrine of obviousness- 
type double patenting as being unpatentable over claims 1 to 26 of U.S. Patent No. 
6,865,536. Although the conflicting claims are not identical, they are not patentably 
distinct from each other because the current claims of the application and the prior 
claims of the patent set forth the same subject matter of two or more clients storing 
audio speech in one or more buffers and a server comprising the capability to receive 
packets from each of the at least two clients. 

A restriction requirement was made in the parent application, Application Serial 
No. 10/199,395, but the current claims of the application do not maintain the line of 
patentable distinctiveness. The current claims merely represent claims elected in the 
parent application, Application Serial No. 10/199,395. 



Application/Control Number: 10/711,114 



Art Unit: 2654 



Page 3 



Claim Rejections - 35 USC § 102 

The following is a quotation of the appropriate paragraphs of 35 U.S.C. 102 that 
form the basis for the rejections under this section made in this Office action: 

A person shall be entitled to a patent unless - 

(a) the invention was known or used by others in this country, or patented or described in a printed 
publication in this or a foreign country, before the invention thereof by the applicant for a patent. 

Claims 1 to 3 are rejected under 35 U.S.C. 102(a) as being anticipated by 
Barclay et al. 

Regarding independent claim 1 , Barclay et al discloses a speech recognition 
system, comprising: 

"two or more clients, each client comprising the capability to receive audio 
speech from a user, store the audio speech in one or more buffers, each buffer 
comprising a portion of the received audio speech, encode a buffer of the received 
audio speech before all of the audio speech is received, package the encoded buffer to 
receive audio speech into one or more packets to be transmitted over the internet 
before all of the audio speech is received, and transmit a packet of encoded audio 
speech over the internet before all of the audio speech is received" - a connection 
between the client and the server can be any communication channel including the 
Internet; a client includes a microphone 10 for accepting audio input ("capability to 
receive audio speech from a user") (column 4, line 62 to column 5, line 11: Figure 1); 
quantized feature data is delivered to dispatcher 26 where it may be temporarily 
buffered ("store the audio speech in one or more buffers, each buffer comprising a 
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portion of the received audio speech") (column 5, lines 36 to 64); front-end 12 is a 
program for collecting the digitized speech, extracting a set of features, and quantizing 
those features ("encode a buffer of the received audio speech") (column 5, lines 4 to 
1 1 ); such buffering does not detract from the real-time aspect since the buffering is to 
accommodate timing delays and synchronization that may be needed; the front end 
streams the quantized data to the dispatcher; "stream" is defined to send substantially 
continuously the data in real-time ("before all of the audio speech is received") (column 
5, lines 48 to 55; column 7, lines 13 to 20); a client sends quantized speech data as 
message packets ("package the encoded buffer of received audio speech into one or 
more packets") (column 7, lines 48 to 59); the packets can be forwarded by the client 
dispatcher to the server ("transmit a packet of encoded audio speech") (column 7, lines 
48 to 59); implicitly, a client/server architecture includes a plurality of clients ("two or 
more clients") serviced by a server; 

"a server, said server comprising the capability to receive packets of encoded 
audio speech from at least two clients, decode each of the packets of audio speech and 
store the resultant raw speech into one or more buffers for the respective client, and 
evaluate the resultant raw speech received from each of the at least two clients" - 
server side 4 includes dispatcher 18 that physically receives the quantized features 
("receive packets of encoded audio speech") (column 5, lines 21 to 35: Figure 1 ); server 
receives quantized speech data as message packets ("to receive packets of encoded 
audio speech") (column 7, lines 48 to 59); server dispatcher accepts and buffers 
messages (digitized quantized features) 60 before the recognizer is ready to receive 
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and process the messages ("decode each of the packets of audio speech and store 
resultant raw speech into one or more buffers for the respective client") (column 7, lines 
26 to 40); speech recognizer/decoder 20 recognizes words from the quantized speech 

§ 

features ("evaluate the resultant raw speech received") (column 5, lines 21 to 35); 
implicitly, a client/server architecture enables a server to simultaneously service and 
buffer speech from a plurality of clients ("from each of the at least two clients 
simultaneously"). 

Regarding claims 2 and 3, Barclay et ai discloses that a transcription or text is 
determined by the speech recognizer ("a result of the server's evaluation of the resultant 
raw speech received from the client"), the transcription is returned to the dispatcher, and 
the dispatcher returns the text to the client ("transmit a response to a client"); 
alternatively, the application program receives and understands the request from the 
client and performs the desired function (column 6, lines 5 to 25); a client has a browser 
78 for displaying HTML (column 8, lines 36 to 47: Figure 4), which implicitly involves "a 
display screen". 

Claim Rejections - 35 USC § 103 

The following is a quotation of 35 U.S.C. 103(a) which forms the basis for all 
obviousness rejections set forth in this Office action: 

(a) A patent may not be obtained though the invention is not identically disclosed or described as set 
forth in section 102 of this title, if the differences between the subject matter sought to be patented and 
the prior art are such that the subject matter as a whole would have been obvious at the time the 
invention was made to a person having ordinary skill in the art to which said subject matter pertains. 
Patentability shall not be negatived by the manner in which the invention was made. 
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Claims 4 to 7 are rejected under 35 U.S.C. 103(a) as being unpatentable over 
Barclay et al. in view of Moshfeghi et al. 

Concerning claims 4 and 7, Barclay et al. discloses a speech-enabled browser 
for displaying data via a graphical user interface (GUI), where the GUI may include a 
LISTEN button (column 8, lines 48 to 64), but does not expressly disclose a text-to- 
speech engine for converting a text format response to audio data, which is played 
through a speaker. However, text-to-speech engines are generally well known in 
interactive voice response (IVR) systems to provide two-way audio interaction without 
requiring a display screen. Moshfeghi et al. teaches an interactive voice response (IVR) 
system in a client/server architecture, where a client provides a text-to-speech 
synthesizer incorporated into a browser/Java® applet for audible messages to avoid 
visual distraction to the user and to minimize storage requirements. (Column 1 , Lines 
39 to 45; Column 3, Lines 37 to 51 ; Column 4, Lines 25 to 36: Figure 1 ) It would have 
been obvious to one having ordinary skill in the art to incorporate a text-to-speech 
synthesizer into the browser of Barclay et al. as suggested by Moshfeghi et al. for the 
purpose of avoiding visual distraction to the user. 

Concerning claim 5, Barclay et al. discloses a server may provide web pages in 
HTML (column 8, lines 36 to 47), but does not expressly disclose two or more stored 
text formats, where the server selects a stored text format to a client as a result of the 
server's evaluation of the speech data. However, Moshfeghi et al. teaches an 
interactive voice response (IVR) system in a client/server architecture, where a server 
transmits Hypertext Markup Language (HTML) pages in accordance with 

■ 
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personalization information stored in user specific data store. The inclusion of such 
capability information allows the server to limit the use of or number of pixels in 
graphical objects in the HTML pages when the display is low resolution (or text only), or 
the bandwidth is limited so as produce unacceptably long download times. The HTML 
page format is determined at user logon when the user speaks his ID and password 
("as a result of the server's evaluation of the resultant raw speech received from the 
client"). (Column 4, Lines 37 to 65: Figure 1 ) It would have been obvious to one having 
ordinary skill in the art for a server to select a stored text format for a client as 
suggested by Moshfeghi et al. in the browser of Barclay et al. for the purpose of 
accommodating low resolution displays and low bandwidth network connections. 

Concerning claim 6, Barclay et al. discloses client and server exchange 
information as message packets (column 7, lines 48 to 59); the transfer of information 
may be organized in conformance with the TCP/IP protocols (column 5, lines 16 to 20); 
implicitly, an internet operating in accordance with a TCP/IP protocol also partitions text 
into message packets ("the capability to partition a stored text format file into one or 
more packets") (column 7, lines 48 to 59). 

Claims 8 to 13 and 18 to 21 are rejected under 35 U.S.C. 103(a) as being 
unpatentable over Barclay et al. in view of Osborne et a/. 

Concerning independent claims 9 and 1 1 , the only elements omitted by Barclay 
et al. are buffers "organized as a linked list" and clients to "write the stored audio speech 
from a first buffer in a first set of buffers to a second buffer in a second set of one or 
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more buffers", where the speech is then packaged and transmitted over the internet 
from the second buffer. However, Osborne et al. teaches a network interface, where it 
is stated that linked lists of buffers are typically used for transmitting from a transmit side 
to a receive side. (Column 3, Lines 59 to 65) In one embodiment, a receive side 
includes a free buffer ring queue 56 ("a first set of one or more buffers") and buffers 64 
and 66 ("a second buffer in a second set of one or more buffers"), where frame data 
received from a network interface is read from free buffer ring queue 56 to either buffer 
64 or buffer 66. (Column 10, Lines 20 to 44: Figure 1 B) Buffers are organized as linked 
lists. (Column 18, Line 58 to Column 19, Line 14: Figure 8) An objective is to ensure 
low overhead and prevent blocking of transmission of frames for other connections. 
(Column 3, Lines 10 to 58) It would have been obvious to one having ordinary skill in 
the art to organize buffers as a linked list and write from first to second buffers as taught 
by Osborne et a/, in the browser of Barclay et al. for the purpose of ensuring low 
overhead and preventing blocking of transmission frames from other connections. 

Concerning claims 8, 18, and 19, Osborne etal. teaches a network interface, 
where it is stated that linked lists of buffers are typically used for transmitting from a 
transmit side to a receive side (column 3, lines 59 to 65); buffers are organized as linked 
lists (column 18, line 58 to column 19, line 14: Figure 8). 

Concerning claims 10 and 21, Barclay etaL discloses digitized speech is 
encoded as cepstra (column 5, lines 2 to 10), which is a compressed format for speech. 

Concerning claims 12 and 13, Barclay et al. discloses that a transcription or text 
is determined by the speech recognizer ("a result of the server's evaluation of the 
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resultant raw speech received from the client"), the transcription is returned to the 
dispatcher, and the dispatcher returns the text to the client ("transmit a response to a 
client"); alternatively, the application program receives and understands the request 
from the client and performs the desired function (column 6, lines 5 to 25); a client has a 
browser 78 for displaying HTML (column 8, lines 36 to 47: Figure 4), which implicitly 
involves "a display screen". 

Concerning claim 20, Osborne et al. teaches a receive side includes a free buffer 
ring queue 56 ("a first set of one or more buffers") and buffers 64 and 66 ("a second 
buffer in a second set of one or more buffers"), where frame data received from a 
network interface is read from free buffer ring queue 56 to either buffer 64 or buffer 66 
(column 10, lines 20 to 44: Figure 1B); implicitly, there are "a predefined number" of 
second buffers, i.e. there are two buffers 64 and 66. 

Claims 14 to 17 are rejected under 35 U.S.C. 103(a) as being unpatentable over 
Barclay et al. in view of Osborne et ai. as applied to claims 1 1 and 12 above, and further 
in view of Moshfeghi et al. 

Similar considerations apply to claims 14 to 17 as to claims 4 to 7, previously 
discussed. 

Conclusion 

The prior art made of record and not relied upon is considered pertinent to 
Applicant's disclosure. 
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Kao, Staats et al., and Dunn et al. disclose related art. 

Any inquiry concerning this communication or earlier communications from the 
examiner should be directed to Martin Lerner whose telephone number is (571 ) 272- 
7608. The examiner can normally be reached on 8:30 AM to 6:00 PM Monday to 
Thursday. 

If attempts to reach the examiner by telephone are unsuccessful, the examiner's 
supervisor, Richemond Dorvil can be reached on (571) 272-7602. The fax phone 
number for the organization where this application or proceeding is assigned is 571- 
273-8300. 

Information regarding the status of an application may be obtained from the 
Patent Application Information Retrieval (PAIR) system. Status information for 
published applications may be obtained from either Private PAIR or Public PAIR. 
Status information for unpublished applications is available through Private PAIR only. 
For more information about the PAIR system, see http://pair-direct.uspto.gov. Should 
you have questions on access to the Private PAIR system, contact the Electronic 
Business Center (EBC) at 866-217-9197 (toll-free). 
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